XOR - XML Oriented Retrieval Language
نویسندگان
چکیده
The wide acceptance and rapidly growing use of XML as a standard storage and retrieval data format blurs the historical divide that exists between Information Retrieval and Database Retrieval. On the structured database retrieval side it is now possible to support highly structured access to documents using XML specific tools such as XPath, XQuery, XQL and more. On the information retrieval side it is possible to support access to the XML documents using XML specific retrieval query languages such as NEXI. None of the above are intended for end-users, but rather as enabling back-end technologies. In this paper we introduce XOR a new XML Oriented Retrieval language that is designed to facilitate query specification with a strong IR flavour. XOR is backwards compatible with NEXI, but significantly extends its functionality overcoming many of its restrictions and limitations. While XOR itself is not an end-user tool, it is designed with the explicit goal of supporting IR, and more specifically, user oriented interfaces such as Natural Language Queries (NLQ) or interactive user interfaces. XOR provides the missing functionality that none of the existing XML retrieval tools support, and which advanced IR requires.
منابع مشابه
XML Retrieval
DEFINITION Text documents often contain a mixture of structured and unstructured content. One way to format this mixed content is according to the adopted W3C standard for information repositories and exchanges, the eXtensible Mark-up Language (XML). In contrast to HTML, which is mainly layout-oriented, XML follows the fundamental concept of separating the logical structure of a document from i...
متن کاملA Novel Watermark Algorithm for Integrity Protection of XML Documents
With the fast development of Extensible Markup Language (XML) and its comprehensive application, the integrity protection of XML documents is becoming pressing. A traditional method for this is digital signature. In this paper, however, based on watermark techniques, we propose a novel solution to the integrity protection of XML documents. In this scheme, watermarks are generated through applyi...
متن کاملLe langage de requêtes XFIRM pour la recherche d'information dans les documents XML
One of the key advantages of XML is its capacity to combine structured and unstructured (text) data. Many languages, based on database-oriented approaches or on information retrieval-oriented approaches, have been proposed in the literature for querying XML corpus. However, they respectively accentuate document structure querying or text querying, but they do not manage to combine it in an attr...
متن کاملXML Information Retrieval and Information Extraction
We present a new query language for information retrieval in XML documents and discuss its combination with information extraction methods. XIRQL is an XML query language which implements IR-related features such as weighting and ranking, relevance-oriented search, datatypes with vague predicates, and structural relativism. For information extracted from texts, XIRQL can rank records based on u...
متن کاملXML Information Retrieval - Achievements and Challenges
Data-centric view: XML as exchange format for structured data Document-centric view: XML as format for representing the logical structure of documents XML Information Retrieval — Achievements and Challenges – p. 2/42 Data-centric view: XML as exchange format for structured data Document-centric view: XML as format for representing the logical structure of documents This talk: focus on document-...
متن کامل